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Abstract. We study random graphs, both G{n,p) and G{n,m), with random orien- 
tations on the edges. For three fixed distinct vertices s, a, b we study the correlation, 
in the combined probability space, of the events {a — s} and {s — ^ 6}. 

For G{n,p), we prove that there is a pc ~ 1/2 such that for a fixed p < Pc the 
correlation is negative for large enough n and for p > pc the correlation is positive for 
large enough n. We conjecture that for a fixed n > 27 the correlation changes sign 
three times for three critical values of p. 

For G{n,m) it is similarly proved that, with p — m/{^, there is a critical pc that 
is the solution to a certain equation and approximately equal to 0.7993. A lemma, 
which computes the probability of non existence of any I directed edges in G{n,m), is 
thought to be of independent interest. 

We present exact recursions to compute P(a s) and P(a — )■ s, s — > b). We also 
briefly discuss the corresponding question in the quenched version of the problem. 



1. Introduction 

For a graph G = {V, E) we orient each edge with equal probabihty for the two possible 
directions and independent of all other edges. This model has been studied previously 
in for instance [1, 6, 12]. Let s,a,b be fixed distinct vertices in the graph. Let {a — )■ s} 
denote the event that there exists a directed path from a to s. The object of this paper 
is to study the correlation of the two events {a — >■ s} and {s ^ 6} in a random graph. 
One might intuitively guess that they are negatively correlated in any graph, that is 
P(a — !> s, s — )■ 6) — P(a — ?> s) P(s — ?> 6) < 0. This is however not true for all graphs. In 
fact, the smallest counter example is the graph on four vertices with all edges except 
{a, b}. In [1] it was proved that for the complete graph Kn they are negatively correlated 
for n = 3, independent for n = 4 and positively correlated for n > 5. The results in [1] 
suggested that the correlation seemed more likely to be positive in dense graphs, which 
led us to further investigate the correlation in G{n,p) and in G{n,m) as presented in 
the present paper. 

There are two different ways of combining the two random processes. In this paper we 
deal with the annealed case, which means that we consider the joined probability space 
of G{n,p) (or G{n,m)) and that of edge orientations. We then consider correlations 
of events in this space. The other possibility is the quenched version, in which one for 
each graph in G{n,p) (or G{n,m)) computes the correlation of the two events in the 
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probability space of all edge orientations in that graph. Then one takes the expected 
value over all graphs. We discuss briefly the quenched version in Section 9. 

In the random graph G{n, p) there are n vertices and every edge exists with probability 
P, < p < 1, independently of other edges. Let G{n,p) be a directed graph obtained 
by giving each edge in G{n,p) a random direction. Our main result in Section 2 implies 
that for a fixed p the events are positively correlated when p > 1/2 for large enough 
n, but negatively correlated when p < 1/2 for large enough n. To be more precise, in 
Theorem 2.1 we prove that 

P(a 7^ s, s 6) - P(a s) P(s /> 6) 2p - 1 

— ; — > as n ^ cc. 

¥{ay^ s,s^b) 3 

Note that the covariance is bilinear so that ¥{a — )■ s, s — )> 6) — P(a — )■ s) ¥{s — )■ 6) = 
P(a 7^ s, s 7^ 6) — P(a -/^ s)P(s -/^ b). Thus we may instead study the complementary 
events {a -f^ s}, that there does not exist a directed path from a to s, and {s -f^ 6}, 
which turns out to be more practical. The change of sign for the correlation at p = 1/2 
was unexpected for us. An attempt to an heuristic explanation is given just after the 
statement of Theorem 2.1. 

In Section 5 we give exact recursions for the probabilities P(a -f^ s),P(s -f^ b) and 
P(a 7^ s,s 7^ b). This is done by studying the size of the set of the vertices that can be 
reached along a directed path from a given vertex s and the size of the set of vertices 
from which there is a directed path to s. We give recursions for the joint distribution of 
these sizes, which could be of independent interest. 

The situation seems however to be much more complicated than what is suggested 
by Theorem 2.1. In Section 6 we report our results from exact computations using the 
recursions. It turns out that for n > 27 the difference P(a — > s, s — )• 6) — P(o — )• s) ■P(,s — )• 
b) seems to change sign three times. First twice for small p (roughly constant /n) and 
then it goes from negative to positive again just before p = 1/2. We find this behavior 
mysterious but conjecture it to be true in general and plan to study this further in a 
coming paper. 

For < m < (2) , G(n, m) is the random graph with n vertices (which we take 
as l,...,n) and m edges, uniformly chosen among all such graphs. We let G{n,m) 
be the directed random graph obtained from G{n, m) by giving each edge a random 
direction. It is less straightforward to study the probabilities in G{n,m), but thanks 
to Lemma 3.2, we can prove the corresponding statement for G{n, m) in parallel with 
the proof for G{n,p). Here the critical probability is ~ 0.799288221, see Theorem 2.2. 
The exact computations and the simulations for G{n, ni) do not give reason to believe 
that there is more than one critical probability for a fixed n. It might be consternating 
that G{n,m) and G{n,p) behave so differently. In Section 8 we explain this and prove 
that the correlation in G{n,p) is always larger, in a certain sense, than in G{n,m). The 
difference is expressed as a variance for which we give an exact asymptotic expression, 
see Theorem 8.2. 



The background of the questions studied in this paper is as follows. Let G be any 
graph and a, b, s, t distinct vertices of G. Further, assign a random orientation to the 



CORRELATIONS FOR PATHS IN RANDOM ORIENTATIONS 



3 



graph as described above, that is each edge is given its direction independent of the 
other edges. In [12] it was proved that then the events {s — > a} and {s b} are 
positively correlated. This was shown to be true also if we first conditioned on {s -f^ t}, 
i.e. P(s ^ a,s ^ 5 I s 7^ > P(s ^ a I s 7^ • P(s 6 I s 7^ t). Note that 
it is not intuitively clear why this is so. It is for instance no longer true if we instead 
condition on {s — ?• t}. In another direction, it was proved that P(s h^a ^ t \ s ^ t) < 
P(s h \ s -f^ ■ P(a t\ s -f^ f). The proofs in [12] relied heavily on the results in 
[5] and [4] where similar statements were proved for edge percolation on a given graph. 
These questions have to a large extent been inspired by an interesting conjecture due to 
Kasteleyn named the Bunkbed conjecture by Haggstrom [8], see also [11] and Remark 5 
in [5]. 

Acknowledgment: We thank the anonymous referees for very helpful comments. 

2. Main Theorems 

Let G{n,p) be the random graph where each edge has probability p of being present 
independent of other edges. We further orient each present edge either way independently 
with probability ^ and call the resulting random directed graph G{n,p). It turns out 
to be convenient to study the negated events {a -f^ s) and {s 6}. We say that two 
events A and B are negatively correlated if 

Cov(^, B) := F{A n B) - F{A) F{B) < 0, 

and positively correlated if the inequality goes the other way. If A and B are negatively 
correlated, then A and ^B are positively correlated. Thus a -/^ s and s -/^ b are 
negatively correlated if and only if {a s} and {s b} are negatively correlated. 
Trivially, P(a 7^ s) = P(s 7^ b). 

All unspecified limits are as n ^ 00. 

The case p = 1 corresponds to random orientations of the complete graph Kn- In [1] 
it was proved that a -/^ s and s -/^ b are negatively correlated in K^, independent in 
and positively correlated in Kn for n > 5. It was in fact proved that the relative 
covariance (P(a 7^ s,s 7^ 6) - P(a -/^ s)P(s 7^ 6))/P(a 7^ s,s 7^ 6) converged to | as 
n ^ 00. 

For G{n,p) our main theorem is as follows. 

Theorem 2.1. For a fixed p G (0, 1] and fixed distinct vertices a,b,s£ G{n,p) we have 
the following limit of the relative covariance 

P(a 7^ s,s 7^ 6) -P(a 7^ s)P(s 7^ 6) 2p - 1 
hm — ■ — = . 

n->oo f>(^a s,s b) 3 

In particular, for large enough n: 

(i) If 0<p< 1/2, then Cov(s ^a,b^s)<0. 

(ii) // 1/2 < p < 1, then Cov{s y^a,b^s)>0. 

Furthermore, 

Cov(s 7^ a, 6 7^ s) = (2p - 1 + o(l))(l - p/2)2"-3. (l) 
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One way to heuristically understand the critical probability 1/2 is as follows. For 
the event {s -f^ a} the two cases that dominate for large n are i) no edge points out 
from s and ii) no edge points in to a, each with probability (1 — p/2)"~^. Similarly for 
{6 7^ s). Thus P(s 7^ a) P(6 -f^ s) is roughly 4 • (1 — p/2)^"~^. For the joint probability 
P(s 7^ a, 6 7^ s) on the other hand, we only need to consider three of these cases since no 
edges at all around s has a much smaller probability. This gives a negative contribution 
to the covariance and is perhaps the most striking contribution. But, the other three 
cases have in the joint probability a factor 1 —p/2 less since one edge is common, which 
gives a positive contribution. Thus the quotient p^J^^p^^g) ~ ' '^hich is greater 
than 1 if and only \i p > 1/2. 

Theorem 2.1 was proved in an earlier version of this manuscript, see [2], with a rather 
lengthy calculation to estimate the involved probabilities. In the present version we 
will present a shorter proof which also has the advantage that it allows us to prove the 
covariance also in G{n,m). However, we want to point out that the lengthy calculations 
in [2] could be shortened with a clever lemma by Backelin, see [3]. 

For G{n,m), we obtain the corresponding result. 

Theorem 2.2. Suppose that m,n ^ oo, and that rn/f^ — > p € (0, 1]. Let pc ~ 0.799 

2p(l-p) 

be the unique root in [0, 1] of 3e (^-p)^ = 4 _ 2p. If n is large enough, then for any 
distinct vertices s,a,6 € G{n,m): 

(i) If <p < Pc, then Cov(s 7^ a, 6 7^ s) < 0. 

(ii) If Pc < p < I, then Cov(s -/i a, b -/^ s) > 0. 

In fact we have the following asymptotic formula for the covariance. 

Theorem 2.3. Suppose that m,n ^ 00, letp = p{n) := m/ and assume liminf p(n) > 
0. Then, for any distinct vertices s,a,b & G{n,m): 

( 3 2p(i-p) \ 

Cov(s 7^ a, 6 7^ s) = e - 1 + o(l) P(s 7^ af 

V4-2p / 

= f3-e -4 + 2p + o(l) J • (1 -p/2)2"-3 .g l^z^ _ 

3. Two LEMMAS 

In order to prove the theorems for G(n, p) and G'(n, m) in parallel we need two lemmas. 
Lemma 3.1. Fix £ > edges in Kn- The probability that none of these i edges appears 
in G{n,m) is at most (l —'71/(2)) ■ 

Proof. Let N := (2). We may assume that m + i < N . Then the probability is 
iV) {N-£)\{N-my. t} N-m-^ ^'-^^ m^ 

(^) (N-e-m)\m J-J- N-i -J-J-v A^;- 

Note that in G{n,p) the corresponding probability is exactly (1 —pY by independence 
of the edges. We next show a more precise version of Lemma 3.1 for the directed graph 
G{n, m). 



g(£;n,m)~(l-p/2/exp(-(-) g ) • (2) 
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Let q{i) = q{i; n, m) be the probability that if we fix I edges in and give each an 
orientation, then none of these directed edges appears in G{n,m). (Here < ^ < (2). 
By symmetry, the probabihty does not depend on the choice of the ^ edges and their 
orientations.) Let also q'{i;n,p) be the corresponding probability in G{n,p). It is clear 
that q'{t,n,p) = (1 -p/2)f For G(n, m) we have the following result, which we believe 
is of independent interest. 

Lemma 3.2. Suppose that £ = l{n) = 0{n) and that < ?n = m{n) < (2). Then, with 
p = p{n) = m{n) / (2) > as n ^ 00, 

■£\^ p{i-py 

.n/ (2-p)^ 

Furthermore, for any £,n,m we have q{£;n,m) < q'{£;n,p) = (1 — p/2)^. 

Proof. Let := (2) • The cases in = and m = N are trivial, so we may assume that 
< m < and thus < p < 1. 

Let X be the number of the £ chosen edges that appear in G{n,m), ignoring orienta- 
tions. Then X has the hypergeometric distribution 

p. ^ _ _ ii) (^-fc) _ {N-£)\{N- m)\m\ 

^ ' Q \kj m{N -m-l + ky.{m-k)V ^' 

Define, for J, j > 0, a{J,j) := J~^J\/{J — j)l (interpreted as when j > J). Then (3) 
can be written as 

W{X = k) = (^^a{N,£)-^N-^a{N -m,£-k){N - mY-^a{m,k)m^ 

^)/(l - pY~^a{N,£y^a{N -m,£- k)a{m, k). 

Given X = k, the probability that none of the selected directed edges appears in G(n, m) 
equals the probability that none of the k selected edges that appear in G(n, m) gets the 
forbidden orientation, which equals (1/2)*^. Hence, 

q{i) = ^2~'¥{X = k) 

k=0 

= E (l) (f ) - ^')'"'«(^' ^)"'«(^ ^)«("^' ^) 

fc=0 ^ ^ 

= a{N, - p/2YE{a{m, Y)a{N -m,i-Y)), 

where Y has the binomial distribution Bin(£,p/(2 —p)). Consequently, (2) is equivalent 
to 

a{N, £y' E(a(m, y)a(iV -m,£-Y))r^ exp (- Q'^^^ . (4) 
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In order to show (4), it suffices to show that every subsequence has a subsequence 
where (4) holds. (See [10, p. 12].) Consequently, by choosing suitable subsequences, we 
may assume that 

l/n^X and p — > vr, (5) 
for some A G [0, oo) and vr € [0, 1]; furthermore, we may assume that either 

£^oo or l = 0{l). (6) 
We next observe that for all J > 1 and j < J, 

j) = J''T{{J-i) = T{[^-^)=^^v[Y. iog(i - j) • 

i=0 i=0 \i=l / 

It follows that < a{j,J) < 1 for all j and J. Moreover, if j = 0{J^^^), then, if also 
j<J, 

1=1 i=l ^ ^ 

and thus (also in the trivial case j > J) 

a(J,j)=exp(-|^)+0(j-i/2). (7) 

In particular, (7) applies to a{N,i), because i = 0(n) = 0{N^^'^), and thus 

a{N, I) = exp {-e/2N) + o(l) ^ exp(-A2) , (8) 

assuming (5). 

Next, consider a(m, y), assuming (5)-(6). Assume first vr > 0. If also I oo, then 
by the law of large numbers, Y/l vr/ (2 — vr), and thus 

nV2 p / vr x2 1 vr , 

2m UMn/ m V2 - vr/ vr (2-vr)2^- 

On the other hand, if ^ = 0(1), then A = 0; furthermore, Y < H. = 0(1) and m ^ oo 
(because vr > 0), so Y'^/2m 0; hence (9) holds in this case too. 
In the case vr = 0, so p — ?> 0, we instead use 

EY = ip/{2 -p)<p£ = o{p^^^£) , 

and thus Y/{p^/'^i) 0. Hence, 

Y' f Y \' pf ( Y y e p,,_ vr 



2m Vp^/^^ / 2m Vp^^^^ / "(^ -1) (2 - vr)^ 

so (9) holds in this case too. 

Consequently, (9) holds in every case. In particular, Y = Op(m^/2), and (7) implies 
that, provided m ^ oo, 

a(m,y) = exp (^-|^) + Op(l) ^ exp (- -vr)^ ^') ' 
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on the other hand, if m = 0(1), then ¥{Y = 0) — )> 1, e.g. by (9), so (10) holds in this 
case too. Consequently, (10) holds in every case. 

Since i - Y Bm{i, 1 - p/{2 - p)) = Bin(£, 2(1 - p)/{2 - p)), it follows by similar 
arguments, now considering the case vr = 1 separately, that 

{l-Yf p 4(l-7r). 



2{N -m) (2 - vr 
and 

p__/ 4(1 -tt). 



a(iV-m,£-y) ^exp ( -^A" ) . (11) 



2-7r 



Combining (10) and (11) we find 



/ \2 

a(m, Y)a{N -m,i-Y)^ exp f (vr + 4(1 - vr)) 

By dominated convergence, using < a{m,Y)a{N — m,l — Y) < 1, this, together with 
(8), yields 

a{N4)-^¥.ia{m,Y)a{N -m,l-Y)) ^ exp A^ - rTr(4-3^) 

^ ' \ [2 — ttY 



exp ( — Y){'^'^ - tt) 



(2-^)2 

which is equivalent to (4) by (5). Hence (4) holds assuming (5)-(6), and thus in general, 
which shows (2). 

Finally, let X' Bin(^,p) be the number of the i chosen edges that appear in 
G{n,p), ignoring orientations. Then, q{£) = E(^)'''' as shown above, and similarly 
q'{i) = ]E(i)^'. In fact, Ea^ < Ea^' for any a > 0, see Hoeffding [9, Theorem 4], 
which, taking a = 1/2, yields q{i) < q'{£). □ 

4. Proofs 

Say that a set 5 C V{G) of vertices in a directed graph G is an outset if there is no 
directed edge vw with v ^ S, w G S (i.e., all edges between S and its complement are 
directed out from S), and an inset if there is no directed edge vw with v G S, w ^ S. 
Hence, S is an outset if and only if its complement is an inset. 

Lemma 4.1. If G is a directed graph and a,b £ V{G), then the following are equivalent: 

(i) ay^b. 

(ii) There exists an inset S with a € S", b ^ S. 

(iii) There exists an outset T with a ^ T , b £ T . 

Proof, (i) <?=^ (ii): No directed path can leave an inset. Hence, if an inset S as in (ii) 
exists, then a -f^b. Conversely, if a y^- 6, then S := \v £ ^(O) : a — )• f } is an inset with 
a^S.b^S. 

(ii) <^=^ (iii): Take T := V{G) \ S, and conversely. □ 

For S Q [n], let Os and Is denote the events {S is an outset} and {5 is an inset}, 
respectively. The events will be in G{n,m) or G{n,p) depending on context. We also 
write, for typographical convenience, Oa ■= C'ja}, etc. 
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Lemma 4.2. Assume that p G (0, 1] is fixed. Then, for any distinct s,a £ G{n,p), 

¥{s A a) = nis) + nOa) + 0{n{l - p/2f^) 
= 2g'(?i-l) + 0(n(l-p/2)2«) 

^2{i-p/2r-\ 

The proof of Lemma 4.2 is very similar to the proof of Lemma 4.3 below, but with 
q{i;n,m) replaced by q'{i;n,p). 

Lemma 4.3. Suppose that ni,n ^ oo, letp = p{n) := 771/(2), (^nd assume \imuiip{n) > 
0. Then, for any distinct s,a £ G{n,m), 

F{s A «) = P(X.) + nOa) + 0(n(l - p/2)2«) 
= 2g(7i-l) + 0(71(1 -p/2)2") 

p(l-p) 

~ 2(1 -p/2)"-ie (5^. 

Proof. Let f'^ be the event that there exists an inset S with s £ S, a ^ S and |5| = k. 
Then, by Lemma 4.1, 

n— 1 n— 2 

{s A «} = U ^'^^ = ^1 u u U ^'^^ (12) 

fc=l fc=2 

Here fi = Is and = Oa- 

If |5| = /c, then F{Is) = F{Os) = q{k{n - k)). In particular, by Lemma 3.2, 

P(T,) = P(a) = q{n - 1) ~ (1 - p/2)""ie" . (13) 
Further, for any k, there are {^ZD sets 5 with s £ S, a ^ S and |5| = k, and thus 

P(ffc) ^ I 1) '?(^(^ - ^)) ^ ^'"''?(^(^ - k)). (14) 
For fixed k we may apply Lemma 3.2 and obtain 

F{Sk) = nSn-k) = 0(n^-i(l -p/2)^(""^')) = 0(7i'=-i(l -p/2)'^"). (15) 

For a fixed -fT, we use this estimate for k < K. For K < k < n — K we have A:(7i — k) > 
K{n — K) and thus q{k{n — k)) < q{K{n — K)), while the total number of subsets S of 
[n] is 2", and thus Lemma 3.2 again yields 

(j £k) <2^q{K{n-K)) = 0{2^{l-p/2f^). (16) 
k=K 

By the assumption liminf j>(7i) > 0, we may assume that p = p{n) > pQ for some pQ > 0. 
Choosing K such that (1 -po/2)^"^ < 1/2, we have 2(1 -p/2)^ < 2(1 -po/2)^"^(l - 
p/2)2 < (1 -p/2)2, and thus (15) and (16) imply 

n-2 K~l 

P(U ^'^) = E 0{n^-\l-p/2f^)+0{{l-p/2f^) = 0{n{l-p/2f^). 

k=2 k=2 
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Hence, (12) yields 

P{s a} = F{£i U £n^i) + 0(n(l - p/2)2") = P(X, u + 0(n(l - p/2)2"). (17) 

Finally, P(Xs n Oa) = q{2n - 3) = 0((1 - p/2)2") by Lemma 3.2 again, and thus 
P(Z, U Oa) = P(2:s) + P(Oa) + C»((l - p/2)2"). The resuh now follows by (17) and 
(13). □ 

Lemma 4.4. Suppose that p € (0, 1] is fixed, then for any distinct s,a,b G{n,p), 
P(s 7^ a and b -/^ s) 

= P(x,,fe n lb) + F{Os,a n Oa) + P(a n ib) + o((i - p/2)2") 

= 3(l-p/2)" + o((l-p/2)2") 

The proof of Lemma 4.4 is almost identical to that of Lemma 4.5 below. 

Lemma 4.5. Suppose that m,n ^ oo, letp = p{n) := 771/(2), cLnd assume Mm. mip{n) > 
0. Then, for any distinct s,a,b [n], 

P(s 7^ a and b -/^ s) 

= P(x,,fe n lb) + P(a,a n Oa) + P(a n ib) + o((i - p/2)2") 

= 3g(2n-3) + o((l-p/2)2") 
~3(l-p/2)2"-3e~^fcF. 
Proof As in (12), we have 

n-l 

{s /> a} = y f^fc with ffc := |J Is- 

k=l S:seS,a(^S,\S\=k 

and similarly 

n-l 

{6 A 4 = U ^ith 4 := U 
First, by the argument in the proof of Lemma 4.3 (choosing K large enough), 

n—3 n—3 

^{\j£k)=^{[j£'k)=0{n\l- p/2)3") = o((l - p/2)2") . 

fc=3 fc=3 

We may thus approximate the event {s -/^ a and b -/^ s} hy 

{£1 U£2U £n-2 U £n~l) H {£[ U £!^ U ^;„2 U 4-1 ) 

IJ Is,, u U a,y n IJ a,, u IJ Xb, 

iXj^a y¥^s I \z^b w^s 

(taking notational advantage of Ig = Is,s and so on). This can be expanded as the union 
of O(n^) events of the type Hj^^j) where A! and y are I or O, and 1 < \Si\, \S2\ < 2. 
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If |S'iU52| > 3, then the event Xs^^ys2 forbids at least |S'iU52|(n- |5iU52|) > 3(n-4) 
directed edges, and Lemma 3.2 shows that 

nXs,^ys,) = 0{{l-p/2f^); (18) 

thus all such combinations have together a probability 0(n^(l— = o((l— 
and may be ignored. 

Furthermore, any event Xg^x'^^Os^z forbids at least n— 3 edges (regardless of orientation) 
in G{n,m), and thus, by Lemma 3.1, 

p(x,,,na,.) < (i-p)"-'. 

Since p > po > 0, we have 

(l-p/2)^ ^ l P + pV4 ^^ 

I — P I — p 

and thus n^(l - p)'"~'^ = o((l — p/2)^("~^)) = o((l — p/2)^'"), so may also ignore ah 
Is,x n Os,z- We may similarly ignore OaCiIb^a, Oa,b<^Ib and Oa,b(^Ib,a- 

This leaves only the events Is^t H lb, H lb-, Oa H It, Oa PI O^, Oa n O^^a. Of these, 
n 2fe C n Zfe and H Os C fl Og^a- Summarizing, we have found 

p(s A a and h ^ s) = p((T,,b nz^) u (0,,^ n a) u (a nX;,)) + o((l -p/2)2"). 

The intersection of any two of the events Xg^b H 2";,, Os,a H Oq and Oa H is contained 
in an event Xs^ n with l^i U = 3, so by (18), its probability is o((l -p/2)2"). 
Hence the first equality in the statement follows. 

Moreover, each of these three events forbids exactly 2n — 3 directed edges, and thus 
each has the probability q[2n — 3). The result follows by Lemma 3.2. □ 

Note the obvious identity F(s -/^ a) = F(6 7^ s). 

Proof of Theorem 2.1. Follows easily from Lemmas 4.2 and 4.4. □ 

Proof of Theorem 2.3. An immediate consequence of Lemmas 4.3 and 4.5. □ 

2p(l-p) 

Proof of Theorem 2.2. Let f{p) := 3e '^^^ -4.+2p. Note that /(O) = -1 and /(I) = 1. 
We will show that f'{p) > for p € [0, 1]; this implies the existence of a unique zero 
Pc € [0, 1] of /, with f{p) < for < p < Pc and f{p) > for pc < P < 1- The result now 
follows from Theorem 2.3. The numerical value pc ~ 0.799288221 is found by Maple. 

We use the partial fraction expansion — p(l — p)/(2 — p)^ = 1 — 3(2 — p)~-^ + 2(2 — p)~^ 
to find 

The factor -3/(2 -p)^ + 4/(2 -p)^ is increasing on [0, 1], as shown by another differen- 

2p(l-p) 

tiation, and is thus at least -3/4 + 4/8 = -1/4. Since < e (^-p)^ < 1, it follows that 
/'(p) > -6/4 + 2 = 1/2 > for p G [0,1]. □ 

Remark 4.6. In the proofs of Lemmas 4. 2 and 4-4) error terms are exponentially 
small compared to the main terms. Consequently, Cov(s -/^ a, b -/^ s)/(l — p/2)^'"~'^ 
approaches its limit 2p — 1 exponentially fast for any fixed p > (this further holds 



+ 2. 
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uniformly for p > po for any po > 0), and similarly, the relative covariance approaches 
its limit in Theorem 2.1 exponentially fast for any fixed p > Q. 

Note, however, that (1) is false in the trivial case p = 0, which shows that the conver- 
gence cannot he uniform for all p > 0. 

Problem 4.7. It would he interesting to know the rate of convergence in G{n,m). 

5. Exact recursions in G{n,p) 

In this section we will give exact recursions to compute 

fn{p) ■■= IPG(n,p)(« A b) and gn{p) := IPG(n,p)(« A 5, s b). 
— > 

For a vertex v € V{G), let C V{G) be the (random) set of all vertices u for which 
there is a directed path from v to u. We will call this the out-cluster from v. Let also 

the in-cluster, C„ C V{G) be the (random) set of all vertices u for which there is a 

directed path from u to v. Note that we will use the convention that v Ci C^. Let 
y := 1 —p/2 be the probability that an edge does not exist with a certain direction, and 
let g := 1 — p be the probability that there is no edge at all. 
For n > 1, s (z X Q [n] and \X\ = k define: 

dp{n,k) :=¥^^^JCs = X), 
where in particular dp{l,l) = 1. 

Lemma 5.1. We have the following recursions 

(i) dp{n, k) = dp{k, k)y^^''~^\ forn>k>l, 

(ii) dp{k,k) = 1 ^jdp{i,i)f^''-'\ and 



i=l 



(iii) fn{p) = Y^{^ _^\dp{k,k)y^(--^\ 



Proof, li n > k there is a vertex w ^ X. The only restriction on w is that it must not 

-> ^ 
have any edge directed from X so F^^^^^{Cs = X) = PG{n-i,p){Cs = X) ■ y and (i) 

— i- 

follows by induction. Clearly Ylx-sexc[n] ^G[np)^^^ ~ ~ ^' '^'^ich gives formula (ii) 
after using equation (i). To get the third equation we sum over all sets X that contain 

s but not b, i.e. /„(p) = J2x^(3[n,p)(^^ = X) = Ylk=i (fc-i)^p(^' which using (i) 
gives (iii). □ 

Note that, by symmetry, also IPg(„p)(Cs = X) = dp{n,k). 

Remark 5.2. Note that by Lemma 2.1 in [12], which is a special case of a theorem by 
McDiarmid, [13], our recursion for dp{k,n) also gives a formula for the probability that 
a given set of vertices X with \X\ = k is the connected component (or open cluster) 
containing s in G{n,p/2). 
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We now want to do something similar for the more comphcated case of gn{p)- For 
n > 1, s e X Q [n], s £ Y C [n] with \X\ = k, \Y\ = m and |[n] \{YUX)\ = r define: 

Mp{n,k,m,r) := IPg(„,p)(C, = X,Cs = Y), 
where in particular Mp(l, 1, 1,0) = 1. 

Lemma 5.3. We have the following recursions for Mp, where k + m > n — r > k,m 
and k,m > 1 

(i) Mp{n,k,m,r) = Mp(n - r, A;, m, 0)g^(''+^'+™-")y''(2"-2^-'=-"^), forr> 0, 

f — t — 1 \ 

(ii) Mp(n,fc,m,0) = j;r^._^ jMp(n-j,fc,m-j,0)(ip(i,jV("-'")- 

^ym+k-n _ yn-k-j^m+k-n^^ij-l)im+k-n)yij-l)in-k-j)^ j^^ n>k,n>m, 

(iii) Mp(n,k,m,r) = Mp{n,m,k,r), 

(iv) Mp{n,n,n,0) = 

l-EL-_i EU-7 E ( _ . M,(n,fc,m,n-m-fc+j), 

j=l ^-^ ^ k=j ^ •' ' m=j ^ •' ' 

n—2n—ln—k+j , ^ 

(v) gn^v) = y]$ZE(-i ■ I. ■ I. 

\3-l,rn-j,k-j,n-k-m+j 

(n — k — l)(n — m — 1) + n — j — 1 

— Mp{n,k,m,n - m-k+j). 

[n — l)[n — 2) 

Proof. Assume, as given for the first equation, that r > 0. All vertices in [n] \ (1" U X) 
must not have any edge directed to Y or from X. This means that there must be no edge 
at all to y nX, which gives probability qHn]\Yux\-\Ynx\ ^ ^r{r+k+m-n) _ n^Yiem must not 
be any edge directed to {Y \ X) and there must not be any edge directed from {X \ Y). 
This gives a factor of ylH\>'ux|.|(y\x)u(x\y)| ^ yr{2n-2r-k-m) ^ 

For equation (ii), note that n> k and r = imply that there exist a vertex w € Y\X. 

— >■ <— 

Let G be any directed graph on n vertices with Cg = X and Cg = Y . If we remove 

vertex w and all its edges from G the resulting graph will still have Cs = X since w ^ X^ 

whereas G^ = Y' , for some Y' such that ynXcy'cy \ {w\. Let j = \Y\ Y'\ and 
sum over all possible Y' . The probability is Mp{n — j,k,m — j,0) that the subgraph 

on [n] \ (y \ y') is as needed. The subgraph on y \ y' must have Gw = Y \ Y' which 
gives probability dp{j,j). There must not be any edge directed from X \ y to y \ Y', 
since the vertices of the latter do not belong to X. The other direction is legal and 
this gives the factor y^"-~"^^^ . There must by the definition of Y' not be any edge at all 
between X nY and Y \ (Y' U {u;}), which gives the factor There must 

also not be any edge directed from Y \ (Y' U {w}) to Y' \ (Y X), which gives the 
factor The only possible edges left to consider have one endpoint in w 

and the other in Y' . The edges between y' \ (y n X) and w could have any direction 
and there must not be any edge directed from Y Ci X to w (since w ^ X), but there 
must be at least one edge directed from w to Y' (since w G Y). This gives probability 
^yTn+k-n . -^n-k-j _ ^m+k-n . yn-k-jy Putting all this together gives formula (ii). 
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The third equation is obtained from the symmetry of reversing all directions. 
The fourth equation follows from the fact that 



X,Y-seY,XC[n] 



Here j = \Y D X\ and recall that s (zY f] X is a necessary condition. 

The last equation is obtained by summing over all possible pairs Y, X such that 
a ^ Y,b ^ X . Again j = \Y r\X\ and the formula is split into the cases when a ^ X and 
a € X, respectively. We get 



We have used Maple to compute the functions /n(p) and gnip) for n < 30. Figure 
1 displays the relative covariance {gnip) — fn{pY) / dnip)- All curves start with being 
mildly negative. They then turn positive and for n < 27 they stay positive. For n > 27 
however, they go below the p-axis again for some time. 

For larger values of n it becomes infeasible for our computers to obtain the exact 
functions. We have instead for various fixed values of p used the recursions to obtain the 
value of the quotient {gn{p) — fn{pY) / gn{p) for n < 300. Based on these calculations we 
conjecture the following. 

Conjecture 6.1. For n > 27, the relative covariance changes sign at three critical 
probabilities pi{n) < P2{n) < pz{n). 

Conjecture 6.2. Asymptotically, pi{n) ci/n and p2{n) ~ C2/n for some constants ci 
and C2- 

The computations indicate that very rough estimates of ci and C2 are 0.36 and 7.5, 
respectively. 

It follows from Remark 4.6 that there is a critical probability p{n) such that p[n) — )> 
1/2 exponentially fast; if our conjectures hold, this critical probability is thus p^in). 

Conjecture 6.3. For all n > 27, ps^n) < 1/2. 



Conjecture 6.4. For n > 8, the relative covariance l^("7^^'^/^|^) ^("^■^) ^(^T^b) ^ 2p^ i ^ 




which after simplification gives the claimed formula. □ 

Note that in G{n,p) the functions P(o -/^ s) and P(a -/^ s,s -/^ b) are polynomials in 
p and hence continuous. 

6. Computations and Conjectures for G{n,p) 
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0.20 n 



0.15 - 



0.10 - 



0.05 - 








Figure 1. The relative correlation fC^A^) G{n,p) for 

n = 8, 10, 12, 14, 16, 18, 20, 22, 24, 25, 26, 27, 28, 29, 30, and the asymptote 
{2p — l)/3. All curves are negative for very small p. For n > 27 we get 
three zeros. 



Note that Conjecture 6.4 implies Conjecture 6.3 and the following related conjecture. 

Conjecture 6.5. For p = ^, and n > 6 the relative covariance '^("7^^'^^^)^^(^'^'^) ^'(''7^^) 
is positive. 



7. Exact recursions in G{n,m) 

For convenience, let yl := {o 7^ s} and B := {s b}. In this section we will derive 
recursions for ¥{A) and ¥{A D B) in m) using the corresponding exact recursions 
in G{n,p) from Section 5. 

Let as above fn{p) := ^G{n,pM) = ^G{n,p)(B) and gn{p) := ^G{n,pM ^ ^"^"^ 
Kim) := Pg(„^^)(^) = ^Gin,m)iB) and fc„(m) := F^^^^^^iAn B). Further, let N := (^) 
be the number of edges in the complete graph Kn and M ~ Bin(A^,p) be the actual 
number of edges in G{n,p). Using that G{n,m) can be seen as {G{n,p) \ M = m), we 

can express the functions fn{p) and gn{p) for G{n,p) using the corresponding functions, 
hn{m) and kn{m), for G{n,m). 
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Up) = P(M = m) • hn{m) = J2 Ijp'^i^-Pf^'" ■ hn{m), 

m=0 m=0 

^ ^ //V\ 

= nM = m) . kn{m) = ^ i)p"'{^-pf~"' ' kn{m). 



m=0 m=0 

These relations can be inverted by repeated differentiating. 
Theorem 7.1. 

1=0 ^ ^ 

Proof. It is sufficient to show one of the recursions. 

N 



fn{p) = Y{^^P'^'^-P)''~'-^n{^ 

loir 



i=0 ^ ^ j=0 
^ ATI 1 

k=Q ^ ^ i=0 ^ 



Differentiating m times and inserting p = gives 



/V' 1 
i=0 



from which we get 

f;(7)(-ir-'./>,.(i) = ^^^/<")(o), 

1=0 ^ ^ 

which, after rearranging, gives the desired recursion. □ 

Computer calculations based on these recursions lead us to the following conjecture. 

Conjecture 7.2. For any fixed n > 5, the covariance in G{n,m) changes sign only once 
between two values of m. 
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8. Comparison between G{n,p) and G{n,m) 

As before, let A := {a -/^ s} and B := {s -/^ h}. For moderate n, the correlation 
between A and B is positive in G{n,p) for quite small p, while, for G{n, m) the proportion 
of links, m/N, where N = Q)) needs to be closer to 1 to get a positive correlation. In 
fact, we may study the conditional covariance given M ^ Bm{N,p) to show that the 
covariance in G{n,p) exceeds the average covariance in G{n,m) for fixed n and p. 



Fact 8.1. 

where 



Cov^(„^p)(^, B) = E{Cov{A, B \ M)) + Var(P(^ | M)) (19) 

N 

E(Cov(A,i? \M)) = Y, P(M = m) • Cov^(„^^)(^, B). 

m=0 

To understand this statement (a standard type of variance analysis) we will as in 
Section 7 view G{n,m) as {G{n,p) \ M = m). Note that 

= E{F{A r\B\M))- K{F{A \ M) F{B | M)) 

+ E(P(A I M)F{B I M)) -E(P(^ I M))E(P(B | M)) 
= E(Cov(A,S I M)) + Cov(P(A I M),F{B \ M)) 

and, as F{A \ M) = F{B \ M), the formula follows. 

The left-hand side of (19) is ~ {2p - 1)(1 - p/2)2"-3 by Theorem 2.1. We can obtain 
the asymptotics of the two terms on the right-hand side too. 

Theorem 8.2. For every fixed p G (0, 1], 

E(Cov(^,5 I A/)) = (^3- (4-2p)e^(^ +o(l))(l-p/2)^""^ 

Var(P(A I M)) = 4(^e^fc)^ - 1 + o(l)) (l - p/2)^""^ 

Note that all three terms in (19), as well as F{A) F{B) = F{A)'^, are of the same order 
(except when a term vanishes), viz. (1 — p/2)^"', see Theorem 2.1 and Lemma 4.2. 

Proof. The sum of the right-hand sides equals {2p — 1 -|- o(l))(l — p/2)^'"~'^, so by (1) 
and (19), it suffices to prove the second formula. 

Let p = M/N = M/(J^^, and note that by the Law of Large Numbers, p — ^ p. We 
begin by observing that by the Central Limit Theorem, 

where denotes convergence in distribution and Z N{0, 1) is a standard normal 
variable. It follows that for any real constants a and b, 

ii-p/2rn+^ = ['-2^p) ^^"pr 2-p ^ ■ 
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By Lemma 4.3, 

^ I \ = 2e +Op(l) =2e +Op(l), 



and thus by (20) 



^ ' ^ ^ 2e exp -^--^-^ —Z . (21) 



(l-p/2)"-i 2-p 
Denote the right-hand side of (21) by R. Since Ke'^^ = e"^^/^ for any real c, we have 

(2 \ P(1-P) 

Ei?'' = 2V 

in particular, ER'^ = Ae (^-p)^ and ER = 2, so Vari? = 4e (^-p)^ _ 4. Hence the result 
follows if the variance converges in (21). For this, it suffices to show that 

(See e.g. [7, Theorems 5.4.2 and 5.4.9] for this standard argument, and note that the 
same argument shows that aU moments converge in (21).) 

To verify (22), let po = p/2. The proof of Lemma 4.3 shows that the estimates in 
Lemma 4.3 (with p replaced by p) hold uniformly for p > pQ. Since q{n — l;n,m) < 
q'{n - l;n,p) = (1 - p/2)"-i by Lemma 3.2, this yields F{A \ M) = 0((1 - p/2)"-i) , 
provided p > pq. For p < pQ, we simply use ¥{A \ M) < 1. Hence, for some constant C, 

The first term on the right-hand side is bounded since ¥{p < pq) = P(M < Npq) < 
]-,y |;]^g Chernoff bound [10, Theorem 2.1]. For the second term we have, letting 
h = 8/{n{2 — p)) and recalling that M is a sum of N independent copies of / ~ Bin(l,p), 

= Eexp(-/i(M-iVp)) = (Eexp(-/i(/-p)))^. 

see [9, (4.16)] (a weaker estimate suffices), and thus the last 
expression is bounded by e^^^^^ = 0(1). This shows (22) and completes the proof. □ 

Numerical computations using the recursions of Sections 5 and 7 also suggest that all 
three quantities in (19), normalized by e.g. (1 — p/2)'^^~^ , converge quickly unless p is 
very small, cf. Remark 4.6; see Figure 2 for the case n = 30. 

9. Quenched version 

As mentioned in the introduction, we have so far studied the annealed model, i.e. 
the joint probability space of G{n,p) (or G{n,m)) and that of the orientations. In the 
quenched model, the covariance is computed for each graph of G{n,p) (or G{n,m)) and 
then averaged over all graphs. 
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Figure 2. The solid curves are the covariance for G{30,p), middle curve, 
which by Fact 8.1 is the sum of the expected covariance for G(30,M), 
bottom, and Var(P(A | M)), top curve. The dashed curves are the as- 
ymptotic curves for G{n,p) (straight line) and expected covariance for 
G{n,M). All curves normalized with 3(1 — p/2)^"~^. 

It is quite common that the results differ between the two models, and this seems to 
be the case here also. We have computed the quenched expectations for G{n,p) (and 
G{n,Tn)) for small n (n < 8) and the covariances, as functions of p, look quite different, 
as can be seen from Figure 3 for n = 8. 




Figure 3. Annealed (top) and quenched (bottom) covariances for G{8,p). 

For n = 4, . . . , 8 there is only one zero for the covariance, but this differs dramatically 
from the zero of the annealed model, as can be seen from Table 1. 
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n 



Annealed Quenched 



4 
5 
6 
7 

8 



1.000 
0.729 
0.276 
0.152 
0.107 



1.000 
0.927 
0.857 
0.809 
0.783 



Table 1. Zeroes of the covariances for G{n,p) in the annealed and 
quenched models. 



Conditioning on the graph and taking expectations, we get that, similarly to Fact 8.1, 



Gov 



Ga{n,p) 



{A,B) = Cov 



Gq(n,p) 



{A, B) + Cov{F{A I G),F{B \ G)), 



(24) 



where Ga{n.,p) denotes the annealed model and Gq{n,p) denotes the quenched model; 
recall that Cov^ ^^^■j{A,B) is defined as W.^ ^^{Cov{A, B \ G)). (A similar formula 

holds for G{n, m) and the corresponding quenched model.) 

Here the conditional probabilities for A and B given the graph need not be equal, so 
that the last covariance in (24) could possibly be negative for some values of p and n. 
Even though our computations show that this is not the case when n < 8. 




Figure 4. Covariances of A and B for G{8,p) (solid) and G(8, m) (o) in 
the quenched model and for G(8, m) (+) in the annealed model. 

It is worth noting that, in contrast with the annealed model, in the quenched model 
G{n,p) and G{n,m) behave similarly, see Figure 4; this is not surprising, since the 
average taken in the quenched G{n,p) can be obtained by averaging over quenched 
G(n, m) with suitable weights. Also the quenched models appear, at least for small 
values of n, to be much closer to annealed G{n,m) than annealed G{n,p). In other 
words, the variation between different graphs with the same number of edges is of less 
importance than the variation caused by different number of edges. (The latter variation 
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is quantified by Theorem 8.2.) This seems intuitively reasonable, since G(n, m) can be 
regarded as G{n,p) conditioned on the number of edges, which thus can be seen as a 
"semi-quenched" version. 

Problem 9.1. It would he interesting to find asymptotics for the quenched versions. 
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